11 research outputs found

    Federated Multi-Armed Bandits

    Full text link
    Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated learning (FL) framework in supervised learning. It is inspired by practical applications in cognitive radio and recommender systems, and enjoys features that are analogous to FL. This paper proposes a general framework of FMAB and then studies two specific federated bandit models. We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution. This model introduces a new uncertainty of client sampling, as the global model may not be reliably learned even if the finite local models are perfectly known. Furthermore, this uncertainty cannot be quantified a priori without knowledge of the suboptimality gap. We solve the approximate model by proposing Federated Double UCB (Fed2-UCB), which constructs a novel "double UCB" principle accounting for uncertainties from both arm and client sampling. We show that gradually admitting new clients is critical in achieving an O(log(T)) regret while explicitly considering the communication cost. The exact model, where the global bandit model is the exact average of heterogeneous local models, is then studied as a special case. We show that, somewhat surprisingly, the order-optimal regret can be achieved independent of the number of clients with a careful choice of the update periodicity. Experiments using both synthetic and real-world datasets corroborate the theoretical analysis and demonstrate the effectiveness and efficiency of the proposed algorithms.Comment: AAAI 2021, Camera Ready. Code is available at: https://github.com/ShenGroup/FMA

    Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

    Full text link
    Offline reinforcement learning (RL) aims at learning an optimal strategy using a pre-collected dataset without further interactions with the environment. While various algorithms have been proposed for offline RL in the previous literature, the minimax optimal performance has only been (nearly) achieved for tabular Markov decision processes (MDPs). In this paper, we focus on offline RL with linear function approximation and propose two new algorithms, SPEVI+ and SPMVI+, for single-agent MDPs and two-player zero-sum Markov games (MGs), respectively. The proposed algorithms feature carefully crafted data splitting mechanisms and novel variance-reduction pessimistic estimators. Theoretical analysis demonstrates that they are capable of matching the performance lower bounds up to logarithmic factors. As a byproduct, a new performance lower bound is established for MGs, which tightens the existing results. To the best of our knowledge, these are the first computationally efficient and nearly minimax optimal algorithms for offline single-agent MDPs and MGs with linear function approximation

    OCCS Classification and Treatment Algorithm for Comminuted Mandibular Fractures Based on 109 Patients and 11 Years Experiences: A Retrospective Study

    No full text
    Comminuted mandibular fractures (CMFs) pose significant challenges to surgeons for their serious complications and poor outcomes. We aimed at proposing a classification with treatment algorithm of each category for CMFs. Patients with CMFs were retrospectively reviewed and classified into five categories: Type I: relatively good occlusion, no or slightly displaced fragments, no continuity destruction or bone defect; Type II: relatively good occlusion, damaged morphology, low comminution degree but intact continuity without bone defect; Type III: damaged morphology and higher comminution degree with intact continuity and relatively good occlusion; Type IV: high comminution, impaired continuity and poor occlusion without segmental bone defect; Type V: segmental bone defect. Conservative treatment, open reduction and internal fixation or microvascular osteocutaneous free flap transplantation was performed, accordingly. Demographics, perioperative data, complications and reasons for reoperations were recorded. The chi-square test was used for statistical analysis. In total, 109 patients were included in the study. After surgery, in the following group, 5 manifested infections, 1 manifested bone non-union, and 2 experienced reoperations, while in the unfollowing group, 10 manifested infections, 5 manifested bone non-union and 8 experienced reoperations. The OCCS classification and algorithm for CMFs achieve better outcomes and with lower complication rate

    Enhanced light-field image resolution via MLA translation

    No full text
    This work describes a method that effectively improves the spatial resolution of light-field images without sacrificing angular resolution. The method involves translating the microlens array (MLA) linearly in both x- and y-directions in multiple steps to achieve 4 ×, 9 ×, 16 × and 25 × spatial resolution improvements. Its effectiveness was firstly validated through simulations with synthetic light-field images, demonstrating that distinct spatial resolution increments can be achieved by shifting the MLA. An MLA-translation light-field camera was built based on an industrial light-field camera, with which detailed experimental tests were carried out on a 1951 USAF resolution chart and a calibration plate. Qualitative and quantitative results prove that MLA translations can significantly improve measurement accuracy in x- and y- directions while preserving z-direction accuracy. Finally, the MLA-translation light-field camera was used to image a MEMS chip to demonstrate that finer structures of the chip can be acquired successfully.Published versionThis work was funded by National Natural Science Foundation of China (12172222) ; Aero Engine Corporation of China (HFZL2020 CXY014-2) ; Fundamental Research Funds for the Central Universities

    Status of lead accumulation in agricultural soils across China (1979-2016)

    No full text
    The first national-scale assessment of lead (Pb) contamination in agricultural soils across China was conducted based on > 1900 articles published between 1979 and 2016. Pb concentrations, temporal and spatial variations, and influencing factors were analyzed. Children's blood lead levels (BLLs) were also estimated using the integrated exposure uptake biokinetic (IEUBK) model. Pb concentrations in different areas of China varied greatly, which was closely associated with the distribution of Pb-related industries, especially Pb-zinc mine smelling, non-ferrous polymetallic mine smelting, e-waste recycling, and leaded gasoline consumption. The year 2000 was a significant transition year for Pb concentrations, with a rapid increase pre-2000 and a subsequent slow upward trend. Pb concentrations were found to be strongly associated with indicators of economic and social development including gross domestic product (GDP), population size, and vehicle ownership. Leaded gasoline, coal combustion, and non-ferrous smelling were the main sources of atmospheric Pb during the different periods. Predicted BLLs were higher in South China than those in the north. This study details the overall Pb contamination status of agricultural soils in China, and thus provides insights for policymakers with respect to pollution prevention measures
    corecore